Aggregation and Association in Cross Tables

نویسندگان

  • Gilbert Ritschard
  • Nicolas Nicoloyannis
چکیده

The strength of association between the row and column variables in a cross table varies with the level of aggregation of each variable. In many settings like the simultaneous discretization of two variables, it is useful to determine the aggregation level that maximizes the association. This paper deals with the behavior of association measures with respect to the aggregation of rows and columns and proposes an heuristic algorithm to (quasi-)maximize the association through aggregation.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Partial Association Components in Multi-way Contingency Tables and Their Statistiical Analysis

In analyses of contingency tables made up of categorical variables, the study of relationship between the variables is usually the major objective. So far, many association measures and association models have been used to measure  the association structure present in the table. Although the association measures merely determine the degree of strength of association between the study varia...

متن کامل

سری آمار: تحلیل جداول توافقی 2 (شاخص‌های بررسی رابطه)

The P-Value cannot present a complete measure of association in medical studies considering the association between categorical variables. In such situations, measures are required to reveal the clinical importance of relation along with their statistical significance, as the effect size. This paper aims to introduce the measures of associations for categorical variables and inferences ab...

متن کامل

Cross-lingual Linking of Multi-word Entities and their corresponding Acronyms

This paper reports on an approach and experiments to automatically build a cross-lingual multi-word entity resource. Starting from a collection of millions of acronym/expansion pairs for 22 languages where expansion variants were grouped into monolingual clusters, we experiment with several aggregation strategies to link these clusters across languages. Aggregation strategies make use of string...

متن کامل

Models for the probability of concordance in cross-classification tables

For cross-classification tables having an ordinal response variable, logit and probit models are formulated for the probability that a pair of subjects is concordant. For multidimensional tables, generalized models are given for the probability that the response at one setting of explanatory variables exceeds the response at another setting. Related measures of association are discussed for two...

متن کامل

Pnm-4: The Association of Depression and Fetal Sex in Pregnant Women with Sleep Disorder

Background: 79% of the pregnant women suffer from sleep disorders and can affect disorders before, during and after childbirth can be involved in causing depression during pregnancy, according to studies the poor quality of sleep in the second trimester of pregnancy is directly related to depressive symptoms in late pregnancy,Also gender difference may have a role in depression. This study aime...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000